Word Sense Disambiguation Using WordNet Relations

نویسندگان

  • Kostas Fragos
  • Yannis Maistros
  • Christos Skourlas
چکیده

In this paper, the “Weighted Overlapping” Disambiguation method is presented and evaluated. This method extends the Lesk’s approach to disambiguate a specific word appearing in a context (usually a sentence). Sense’s definitions of the specific word, “Synset” definitions, the “Hypernymy” relation, and definitions of the context features (words in the same sentence) are retrieved from the WordNet database and used as an input of our Disambiguation algorithm. More precisely, for each sense of the word a sense bag is formed using the WordNet definition and the definitions of all the “Hypernyms” associated with the nouns and verbs in the sense’s definition. A similar technique is used, for all the context words and the definitions of the “Hypernyms” (associated with the context nouns and verbs), to form a context bag. Then, a technique of assigning weights to words is applied. The weight for every word is inversely proportional to the hierarchy depth in the WordNet taxonomy of the associated “synset”. Eventually, the disambiguation of a word in a context is based on the calculation of the similarity between the words of the sense bags and the context bag. The proposed method is evaluated in disambiguating all the nouns for all the sentences in the Brown files.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Construction of Semantic Relations for Enhancing Word Sense Disambiguation in Question Answering Systems

Word sense disambiguation is a significant problem at the lexical level of natural language processing. The philosophy is to determine the meaning of a word in a particular usage, by using sense similarity and syntactic context with corpus evidence as well as semantic relations from WordNet. A training set will be constructed for each word tag (using the corpus). Each training example is repres...

متن کامل

Word Sense Disambiguation Using Semantic Graph

This work describes a method of word sense disambiguation by finding similar words in a text. We have used some characteristic properties of the text and its constituent words for the disambiguation task. Using the WordNet, the algorithm constructs a semantic structure on the text illustrating the relations among the words of the text. This structure is then used for disambiguating the constitu...

متن کامل

Using Relevant Domains Resource for Word Sense Disambiguation

This paper presents a new method for Word Sense Disambiguation based on the WordNet Domains lexical resource [4]. The underlaying working hypothesis is that domain labels, such as ARCHITECTURE, SPORT and MEDICINE provide a natural way to establish semantic relations between word senses, that can be used during the disambiguation process. This resource has already been used on Word Sense Disambi...

متن کامل

Improvements To Monolingual English Word Sense Disambiguation

Word Sense Disambiguation remains one of the most complex problems facing computational linguists to date. In this paper we present modification to the graph based state of the art algorithm In-Degree. Our modifications entail augmenting the basic Lesk similarity measure with more relations based on the structure of WordNet, adding SemCor examples to the basic WordNet lexical resource and final...

متن کامل

Class-based collocations for Word Sense Disambiguation

This paper describes the NMSU-Pitt-UNCA word-sense disambiguation system participating in the Senseval-3 English lexical sample task. The focus of the work is on using semantic class-based collocations to augment traditional word-based collocations. Three separate sources of word relatedness are used for these collocations: 1) WordNet hypernym relations; 2) cluster-based word similarity classes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003